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Abstract 

The absence of a globally nonsingular three-parameter representation of rotations 
forces attitude Kalman filters to estimate either a singular or a redundant attitude 
representation. We compare two filtering strategies using simplified kinematics and 
measurement models. Our favored strategy estimates a three -parameter 
representation of attitude deviations from a reference attitude specified by a higher- 
dimensional nonsingular parameterization. The deviations from the reference are 
assumed to be small enough to avoid any singularity or discontinuity of the three- 
dimensional parameterization. We point out some disadvantages of the other 
strategy, which directly estimates the four-parameter quaternion representation. 

Introduction 

Real-time spacecraft attitude estimation generally employs an Extended Kalman 
Filter (EKF) [1,2]. Although the 3x3 orthogonal attitude matrix is the fundamental 
representation of the spacecraft’s attitude, the orthogonality requirement imposes 
six constraints on its nine elements, reflecting the fact that the special orthogonal 
group SO(3) of rotation matrices has dimension three. Therefore, most EKFs use 
lower-dimensional parameterizations of SO(3), with the earliest using a minimal 
three-dimensional parameterization [3]; but higher-dimensional parameterizations 
can avoid the singularities or discontinuities present in all three-parameter 
representations [4]. The four-component quaternion has the lowest dimensionality 
possible for a globally non-singular representation of SO(3), but it still has one 
superfluous degree of freedom. Thus we face the dilemma of using an attitude 
representation that is either singular or redundant. Our preferred strategy for 
evading this dilemma uses a nonsingular representation for a reference attitude and 
a three-component representation for the deviations from this reference. This 
method, which we refer to as the Multiplicative EKF (MEKF), was implicit in some 
early spacecraft attitude estimators [5-8] and has been discussed in detail in 
Reference 9. An alternative strategy, the Additive EKF (AEKF), treats the four 
components of the quaternion as independent parameters [10]. 

We begin with a brief overview of quaternion estimation, emphasizing the 
conceptual difficulties. The MEKF is then discussed in a model with simplified 
kinematics and measurements. This is followed by an analysis of two versions of 
the AEKF, using the same model as the MEKF, and by a summary of our 
conclusions. Reference 1 1 is an extended version of this paper. 



Quaternion Estimation 

The attitude matrix is generally written as a homogeneous quadratic function of the 
quaternion [12], 

A(q) = A\ [ q 1 = {q} -|<f )/ + 2qq T -2ft[qx], (1) 
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where I denotes the 3 x 3 identity matrix, and the cross product matrix is 

[ 0 -q 3 q 2 

[qx]« q 3 0 -ft . (2) 
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We use the quaternion product convention of References 8 and 12 so that 

(\ p 1 \ q]\ (\ p, q + ^P-pxqTl _ 

A(p®q) = A ® -A (3) 

\[p*\ M) {[ PaQa -p*q \) 

A conventional “additive’' quaternion filter defines the estimate g and error Ag by 
qs E{q} and Aq = q -q , (4) 

where £{•} denotes the expectation. This means that that 

=||f + £{|A?| 2 }>(|| . (5) 

Equation (1) gives an orthogonal matrix only if the quaternion has unit norm. 
Equation (5) shows that if the random variable q has unit norm, the norm of its 
expectation must be less than unity. Although the unity norm constraint violation 
will be on the order of the variance of the attitude errors, we regard this as a serious 
conceptual problem with additive quaternion EKFs, so we turn first to the MEKF. 

Multiplicative EKF 

The MEKF represents the attitude as the quaternion product 

q = Sq(B)® q ref , (6) 

where q ref is some unit reference quaternion and the rotation <5? (a) from q ref to the 
true attitude q is parameterized by a three-component vector a. Although several 
choices for a are possible [9], it is only important for this paper that for small 
rotations 

8q( a) * 1 + ^a + orderfaf ) , (7) 

where 1 denotes the identity quaternion and a denotes a quaternion with vector 
part a and scalar part 0. The two attitude representations Sq and q r ' ef in Eq. (6) are 

clearly redundant. The basic idea of the MEKF is to compute an unconstrained 
estimate of the three-component a while using the correctly normalized four- 
component q ref to provide a globally nonsingular attitude representation. Given an 



estimate a = E{ a} of a, Eq. (6) says that dq(a)® q ref is the corresponding estimate 
of the true attitude quaternion q . We remove the redundancy in the attitude 
representation by choosing the reference quaternion q ref so that a is identically 
zero, which means that 6q(0) is the identity quaternion and the reference 
quaternion is the estimate of the true quaternion. The identification of q ref as the 

attitude estimate means in turn that a is a three-component representation of the 
attitude error. This provides a consistent treatment of the attitude error statistics, 
with the covariance of the attitude error in the body frame represented by the 
covariance of a. The fundamental conceptual advantage of the MEKF is that q ref is 

a unit quaternion by definition. 

Kinematics 

The quaternion kinematic equation is 

q~\<5®q, ( 8 ) 

where co is the angular velocity vector in the body frame. Since q nf is also a unit 
quaternion, it must obey 

Qr'f = (9) 

where to ref , the angular velocity of the reference attitude, must be determined by 
the requirement that a be identically zero. Note that q ref and co ref are not random 
variables. Computing the time derivative of Eq. (6), using Eqs. (8) and (9), gives 

Sq~ \{(5®dq-Sq®(5^) y (10) 

after right-multiplying the entire equation by the inverse of q ref and rearranging. 


We now assume the simple kinematic model 

co (0 =co (0 + n w (r), ( 11 ) 

where co ( t ) is the nominal angular velocity, and the zero-mean Gaussian process 
noise obeys 


with 5(t - t r ) denoting the Dirac delta function. Substituting Eq. (10) into the time 
derivative of Eq. (7) and linearizing in a and n a gives 

a « co - co + F a a + G a n fl) (13) 


with 

K "-iK© +ttr,/)x] and G fl s / . (14) 

Measurements 

We model the measurement of a vector \ B in the body coordinate frame by 

z-v a «h(a) + n 2 = A@V/+n 2 , ( 15 ) 


where v, is the corresponding vector in the inertial frame, and the zero-mean 
Gaussian measurement noise n z is assumed to be isotropic, 

R “ £{n z iij} - o\l . (16) 

Substituting Eqs. (1), (3), (6), and (7) into Eq. (15) and linearizing in a gives 

h(a)«v*+ff fl a, (17) 

where \ B is the predicted measurement in the body frame, 

▼j- A®*)*/. (18) 

and the measurement sensitivity matrix is ~ 

(19) 

The attitude error estimate a is propagated by the expectation of Eq. (13) with 
a = 0, conditioned on the measurements [1], 

a = o5 -a> rtf + P a H T a R~'[z- h(0)] - cb - <o ref - a' t 2 P„ y b xv b , (20) 

where P a is the covariance of the error vector a. This is propagated using the 
approximation co rtf/ « to , which neglects only terms of higher order than are 

customarily retained in an EKF and gives 

P a - F a P a + P a Fj + G a QG T a - P a H T a R-'H a P a 

- -[cox]P a - P.[(Sx] r + a 2 J -o~ 2 P a [ v s x ] r [ v s x ]P a . ; 

Since the MEKF requires that a be identically zero, Eq. (20) gives to ref as 

(0 re/ = (0 + o; 2 P a (Vg X v B ) . (22) 

The first term embodies the assumed kinematics, while the second incorporates the 
measurement updates. Equations (9) and (22) preserve the unity norm of the 
reference quaternion. No reset operation is needed with continuous measurements, 
in contrast to the discrete measurement case [9]. We emphasize that the MEKF does 
not require integration of Eq. (20), which is only used to derive Eq. (22). 

Additive EKF 


The AEKF relaxes the quaternion normalization condition and treats the four 
components of the quaternion as independent parameters, with the quaternion 
estimate and error given by Eq. (4). Linearizing Eqs. (8) and (11) gives the error 
vector kinematics 


with 
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-to 0. 


-q J 


(23) 


( 24 ) 



The quaternion in an AEKF has uncertainties in both angle and norm. Angle 
uncertainties have the form given by Eq. (6), except that that the role of q ref in the 
MEKF is taken by q in an AEKF. Norm errors are parallel to q , so we can write 

q = 5q{a)®q + ffi «[s f] 2 ® +^f = f + [s f] 2 * . (25) 

The approximate equality follows from the product rule and Eq. (7). Note that the 
norm error /? is a relative error (multiplying the norm of q by 1+/1) rather than an 
absolute error (adding (3 to the norm of q ). Then the 4x4 quaternion covariance is 

P q E ^{Aq &q T } = [h fl [ \ P ° ^ Pc l[E ff, - (26) 

L W« Pn J L J 

where P a is the 3x3 attitude covariance, p n is the quaternion norm variance, and p c 
is the vector of attitude/norm covariances. This factorization is not used in 
unconstrained quaternion estimators, which directly estimate the four-component 
A q and the 4x4 P q . 

Measurements 

The AEKF measurement model is very similar to the model in the MEKF, given by 
Eq. (15). The AEKF uses a different parameterization of the attitude matrix A(q ) , 
however, and h is regarded as a function of the quaternion q rather than of the 
attitude error vector a. We will consider two different implementations of the 
AEKF. The first, which we call the quadratic AEKF, uses Eq. (1) for A(q). We 
have seen that this attitude matrix is not orthogonal unless the quaternion has unit 
norm, but we will show that these measurements drive the norm to unity in the 
quadratic AEKF. The second implementation uses the orthogonal attitude matrix 
formed by using the normalized quaternion q/\q\ in Eq. (1) 

A R (q) = A(q/^\)^\' 2 A(q). (27) 

We refer to this as the ray representation AEKF because any non-zero quaternion 
along a ray in quaternion space (a straight line through the origin) represents the 
same attitude. The twofold ambiguity of the unit quaternion representation 
corresponds to the two points where the ray pierces the unit sphere. After a good 
deal of algebra, we find the measurement sensitivity matrix 

H g *dhldq\ r 2$\~ 2 ([v B x]E T +kv B ¥), (28) 

where \ B is the predicted measurement given by Eq. (18) with q ref replaced by q , 

and where k = 1 for the quadratic AEKF and k = 0 for ray representation AEKF. 
The form of the measurement sensitivity matrix implies that quaternion norm 
errors, which are parallel to q , are observable in the quadratic AEKF, but not in the 
ray representation AEKF. 



(29) 


The expectation of Eq. (8) conditioned on the measurements is 
3-Lu®?+P q H T q R-'[ z-h(f)] 

- 2 ©«jr ® f + ^[(Vs x p c + 2kp n y s ) ■ (y B -v B )]q, 

where 

co <#S (»+a z ' 2 {F 0 (v fi xv B ) + 2 ( t[v fi -(v fl -v s )]p c }. (30) 

The term parallel to q on the right side of Eq. (29) modifies the norm of the 
quaternion estimate, but it is not difficult to show that 

d( #l$>l dt • (31) 

Since co^ in the ray representation AEKF ( k = 0) is the same as co ref in the MEKF, 

and since the ray representation AEKF uses the normalized quaternion to compute 
the attitude via Eq. (27), it gives the same attitude estimate as the MEKF. The ray 
representation AEKF was used to estimate the attitude of the ALEXIS and CAPER 
spacecraft [13, 14]. 

The quadratic AEKF (k= 1) is different. Since the attitude matrix depends on the 
quaternion norm in this representation, the term parallel to q on the right side of 
Eq. (29) is significant. If q is not a unit quaternion and p n is not zero, \ B - v a will 
have an error along \ B that drives the norm of q toward unity. The angular update 
given by Eq. (30) is the same as for the ray representation AEKF and MEKF if p c 
is zero, but is different if p c is not zero. It is difficult to avoid the conclusion that 
the quadratic representation AEKF gives incorrect estimates in the latter case. 
Substituting Eqs. (26) and (28) into the covariance propagation equation 

p q = FqPq + P q F q T + G q QG\ -P q H T q R-'Hf q . (32) 

gives, after some algebra, 

Pa --{Mxy* -P.[axY +a 2 /-a; 2 (p o [v B xf[v i ,x]/ > 0 + 4A:|v fi | 2 p c p:) > (33a) 

Pc = -to x p c -cr; 2 (p o [v B xf [v B x] + 4fc|v e f p„)p c> (33b) 

Pn =-cr; 2 (|v B x p c | 2 +4fc|v B fpj. (33c) 

Equation (33a) shows that the 3x3 attitude covariance P a obeys the same equation 
in the ray representation AEKF ( k = 0) as in the MEKF. Equation (33b) shows that 
p c is zero for all time in either AEKF if its initial value is zero. Equation (26) shows 
that this is the case if the initial estimate q is an eigenvector of the initial 
covariance; this includes the case that the initial P q is a multiple of the identity. The 
measurement term in Eq. (33b) will drive p c to zero if it is not initially zero. In the 
ray representation AEKF, the norm variance p n is constant if p c is zero, which may 


cause loss of significance of the attitude covariance if the attitude estimate 
converges to small errors, since P a is mixed with p n in the quaternion covariance P q . 

The attitude covariance P a in the quadratic AEKF ( k = 1) is different from the other 
two filters unless p c is zero. The last term in Eq. (33c) drives p n to zero in the 
quadratic AEKF, so the 4x4 covariance matrix becomes singular with null vector 
q . The resultant potential instability of the quadratic AEKF can be avoided only by 
adding unphysical process noise to the quaternion magnitude. Note that the 
convergence of j^| to unity and of p n to zero is a result purely of measurement 

processing, requiring neither “brute force” normalization of the quaternion nor 
norm-enforcing pseudo-measurements. Such subterfuges have been found useful in 
the presence of norm errors arising from numerical and discretization effects in 
AEKF implementations, however [10]. 

Discussion 

Of the methods considered in this paper, the MEKF requires the least computational 
effort due to the lower dimensionality of its covariance matrix. The MEKF is also 
the most satisfying conceptually, since it respects the dimensionality of the rotation 
group and its attitude estimate is a unit quaternion by definition. 

The ray representation AEKF is conceptually simpler than the MEKF, since it 
requires no attitude parameterization other than the quaternion, and it gives the 
same attitude estimates and covariance at the cost of some additional computational 
burden. Numerical significance issues may arise from the unobservable quaternion 
norm degree of freedom, but these have not caused difficulties' in applications. 

The quadratic AEKF rests on less secure foundations. We have shown that the 
quaternion approaches unit norm, and thus the attitude matrix approaches an 
orthogonal matrix, as a natural result of measurement processing, without resorting 
to “brute force” normalization of the quaternion or norm-enforcing pseudo- 
measurements. If the initial quaternion estimate is an eigenvector of the initial 4x4 
quaternion covariance, the attitude errors and quaternion norm errors are 
uncorrelated for all time, and the attitude covariance and attitude estimates are the 
same as the other filters. If the correlation between the attitude errors and 
quaternion norm errors is not zero, the attitude update and attitude covariance differ 
from those of the other filters. The estimates provided by this method should be 
regarded with caution in either case, since the attitude matrix is not exactly 
orthogonal and the 4x4 covariance matrix becomes singular, potentially leading to 
instability that can be avoided only by adding unphysical process noise to the 
quaternion magnitude. 

The motivation for considering the additive EKFs, despite the conceptual and 
computational advantages of the MEKF, appears to be their superficial resemblance 
to a linear Kalman filter. This resemblance is deceiving, because the process noise 
and any dynamic parameters to be estimated enter mutiplicatively rather than 
additively in the quaternion kinematics equation. Realistic measurement models are 
also nonlinear, so we see no valid reason to prefer an AEKF to the MEKF. 
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